Applications to Subset Selection for MVE Estimator and Multiclass Discrimination
نویسندگان
چکیده
The MVE estimator is an important estimator in robust statistics and characterized by an ellipsoid which contains inside 100β percent of given points in IR n and attains the minimal volume, where β ∈ [0.5, 1.0) is of usual interest. The associated minimal ellipsoid can be considered as a generalization of the minimum volume covering ellipsoid which covers all the given points since the latter ellipsoid corresponds to the former one with β = 1.0. Though the computation of the minimal covering ellipsoid is tractable, that of the MVE estimator or, equivalently, the associated minimal ellipsoid is cumbersome since it has a nonconvex structure in general. In this paper, we present a new formulation for constructing an ellipsoid which also generalizes the notion of the minimum volume covering ellipsoid on the basis of the CVaR minimization technique which is proposed by Rockafellar and Uryasev [13]. In contrast to computing the MVE estimator, the proposed ellipsoid construction is formulated as a convex optimization and an interior point algorithm for the solution can be developed. In addition, the optimization gives an upper bound of the volume of the ellipsoid associated with the MVE estimator, which fact can be exploited for approximate computations of the estimator. Also, potential applicability of the new ellipsoid construction is discussed through two statistical problems: 1) robust statistics computations including outlier detection and the computation of the MVE estimator; 2) a multiclass discrimination problem, where the maximization of the normal likelihood function is characterized in the context of the ellipsoid construction. Numerical results are given, showing the nice computational efficiency of the proposed interior point algorithm and the capability of the proposed generalization.
منابع مشابه
Conditional Minimum Volume Ellipsoid with Applications to Subset Selection for MVE Estimator and Multiclass Discrimination
In this paper, we present a new formulation for constructing an ellipsoid which generalizes the computation of the minimum volume covering ellipsoid, based on the CVaR minimization technique proposed by Rockafellar and Uryasev (2002). The proposed ellipsoid construction is formulated as a convex optimization and an interior point algorithm for the solution can be developed. In addition, the opt...
متن کاملPositive-Shrinkage and Pretest Estimation in Multiple Regression: A Monte Carlo Study with Applications
Consider a problem of predicting a response variable using a set of covariates in a linear regression model. If it is a priori known or suspected that a subset of the covariates do not significantly contribute to the overall fit of the model, a restricted model that excludes these covariates, may be sufficient. If, on the other hand, the subset provides useful information, shrinkage meth...
متن کاملAn Overview of the New Feature Selection Methods in Finite Mixture of Regression Models
Variable (feature) selection has attracted much attention in contemporary statistical learning and recent scientific research. This is mainly due to the rapid advancement in modern technology that allows scientists to collect data of unprecedented size and complexity. One type of statistical problem in such applications is concerned with modeling an output variable as a function of a sma...
متن کاملDiscrimination of Power Quality Distorted Signals Based on Time-frequency Analysis and Probabilistic Neural Network
Recognition and classification of Power Quality Distorted Signals (PQDSs) in power systems is an essential duty. One of the noteworthy issues in Power Quality Analysis (PQA) is identification of distorted signals using an efficient scheme. This paper recommends a Time–Frequency Analysis (TFA), for extracting features, so-called "hybrid approach", using incorporation of Multi Resolution Analysis...
متن کاملMulticlass SVM-RFE for product form feature selection
Various form features affect consumer preference regarding product design. It is, therefore, important that designers identify these critical form features to aid them in developing appealing products. However, the problems inherent in choosing product form features have not yet been intensively investigated. In this paper, an approach based on multiclass support vector machine recursive featur...
متن کامل